Skip to content

[SPARK-56998]Add SECURITY.md + AGENTS.md Security section for scan-agent discoverability#55933

Closed
potiuk wants to merge 3 commits into
apache:masterfrom
potiuk:asf-security/discoverability-2026-05-18
Closed

[SPARK-56998]Add SECURITY.md + AGENTS.md Security section for scan-agent discoverability#55933
potiuk wants to merge 3 commits into
apache:masterfrom
potiuk:asf-security/discoverability-2026-05-18

Conversation

@potiuk
Copy link
Copy Markdown
Member

@potiuk potiuk commented May 17, 2026

This is a proposal for the PMC to review — please correct, reject, or discuss as needed. Nothing here is a requirement; the maintainer is the decision-maker.

This adds a SECURITY.md to the repo root and a Security section to the existing AGENTS.md so an automated scan agent can mechanically discover the project's security model via the conventional AGENTS.md → SECURITY.md → model URL chain. The chain terminates at the existing https://spark.apache.org/docs/latest/security.html page — nothing about the model content itself changes.

Context: the ASF Security team is preparing the project for an automated agentic security scan we're piloting. Such scans refuse to run if the model isn't discoverable by that path (refusing upfront beats wasting PMC reviewer cycles on a noise-heavy run against an unknown model). Discoverability is the one hard gate; everything else is suggestion. The Security team has reached out separately on the PMC's private list with the program details; this PR is the public-facing repo piece.

The Security team uses threat-model-producer as the rubric for what a complete model looks like — but this PR is just the link; the existing security.html content is accepted as the model.

After this lands on master, the same two files would need to be on branch-3.5 for the second scan target — happy to open a cherry-pick PR for that, or leave it to the PMC.

Questions / pushback welcome. Happy to adjust the wording or move the section if the project has a house style.

potiuk and others added 2 commits May 18, 2026 00:48
…bility

Adds a SECURITY.md to the repo root and a Security section at the end of
the existing AGENTS.md so an automated scan agent can mechanically
discover the project's security model via the conventional
AGENTS.md -> SECURITY.md -> model URL chain. The chain terminates at the
existing https://spark.apache.org/docs/latest/security.html document.

Generated-by: Claude Code (Claude Opus 4.7)
@HyukjinKwon
Copy link
Copy Markdown
Member

I think we should probably file a JIRA though.

@gatorsmile gatorsmile changed the title Add SECURITY.md + AGENTS.md Security section for scan-agent discoverability [SPARK-56998]Add SECURITY.md + AGENTS.md Security section for scan-agent discoverability May 21, 2026
@gatorsmile
Copy link
Copy Markdown
Member

Added a JIRA

Comment thread AGENTS.md Outdated
Comment thread SECURITY.md Outdated
Per the existing style in AGENTS.md (paragraphs are single
long lines; markdown renderers handle wrap), unwrap the added
Security section in AGENTS.md and the body paragraphs in
SECURITY.md.

Generated-by: Claude Code (Claude Opus 4.7)
HyukjinKwon pushed a commit that referenced this pull request May 22, 2026
…gent discoverability

**This is a proposal for the PMC to review — please correct, reject, or discuss as needed.** Nothing here is a requirement; the maintainer is the decision-maker.

This adds a `SECURITY.md` to the repo root and a `Security` section to the existing `AGENTS.md` so an automated scan agent can mechanically discover the project's security model via the conventional `AGENTS.md → SECURITY.md → model URL` chain. The chain terminates at the existing <https://spark.apache.org/docs/latest/security.html> page — nothing about the model content itself changes.

Context: the ASF Security team is preparing the project for an automated agentic security scan we're piloting. Such scans refuse to run if the model isn't discoverable by that path (refusing upfront beats wasting PMC reviewer cycles on a noise-heavy run against an unknown model). Discoverability is the one hard gate; everything else is suggestion. The Security team has reached out separately on the PMC's private list with the program details; this PR is the public-facing repo piece.

The Security team uses [`threat-model-producer`](https://gist.github.com/potiuk/da14a826283038ddfe38cc9fe6310573) as the rubric for what a complete model looks like — but this PR is just the *link*; the existing `security.html` content is accepted as the model.

After this lands on `master`, the same two files would need to be on `branch-3.5` for the second scan target — happy to open a cherry-pick PR for that, or leave it to the PMC.

Questions / pushback welcome. Happy to adjust the wording or move the section if the project has a house style.

Closes #55933 from potiuk/asf-security/discoverability-2026-05-18.

Lead-authored-by: Jarek Potiuk <jarek@potiuk.com>
Co-authored-by: Xiao Li <gatorsmile@gmail.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 411dedc)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
HyukjinKwon pushed a commit that referenced this pull request May 22, 2026
…gent discoverability

**This is a proposal for the PMC to review — please correct, reject, or discuss as needed.** Nothing here is a requirement; the maintainer is the decision-maker.

This adds a `SECURITY.md` to the repo root and a `Security` section to the existing `AGENTS.md` so an automated scan agent can mechanically discover the project's security model via the conventional `AGENTS.md → SECURITY.md → model URL` chain. The chain terminates at the existing <https://spark.apache.org/docs/latest/security.html> page — nothing about the model content itself changes.

Context: the ASF Security team is preparing the project for an automated agentic security scan we're piloting. Such scans refuse to run if the model isn't discoverable by that path (refusing upfront beats wasting PMC reviewer cycles on a noise-heavy run against an unknown model). Discoverability is the one hard gate; everything else is suggestion. The Security team has reached out separately on the PMC's private list with the program details; this PR is the public-facing repo piece.

The Security team uses [`threat-model-producer`](https://gist.github.com/potiuk/da14a826283038ddfe38cc9fe6310573) as the rubric for what a complete model looks like — but this PR is just the *link*; the existing `security.html` content is accepted as the model.

After this lands on `master`, the same two files would need to be on `branch-3.5` for the second scan target — happy to open a cherry-pick PR for that, or leave it to the PMC.

Questions / pushback welcome. Happy to adjust the wording or move the section if the project has a house style.

Closes #55933 from potiuk/asf-security/discoverability-2026-05-18.

Lead-authored-by: Jarek Potiuk <jarek@potiuk.com>
Co-authored-by: Xiao Li <gatorsmile@gmail.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 411dedc)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
HyukjinKwon pushed a commit that referenced this pull request May 22, 2026
…gent discoverability

**This is a proposal for the PMC to review — please correct, reject, or discuss as needed.** Nothing here is a requirement; the maintainer is the decision-maker.

This adds a `SECURITY.md` to the repo root and a `Security` section to the existing `AGENTS.md` so an automated scan agent can mechanically discover the project's security model via the conventional `AGENTS.md → SECURITY.md → model URL` chain. The chain terminates at the existing <https://spark.apache.org/docs/latest/security.html> page — nothing about the model content itself changes.

Context: the ASF Security team is preparing the project for an automated agentic security scan we're piloting. Such scans refuse to run if the model isn't discoverable by that path (refusing upfront beats wasting PMC reviewer cycles on a noise-heavy run against an unknown model). Discoverability is the one hard gate; everything else is suggestion. The Security team has reached out separately on the PMC's private list with the program details; this PR is the public-facing repo piece.

The Security team uses [`threat-model-producer`](https://gist.github.com/potiuk/da14a826283038ddfe38cc9fe6310573) as the rubric for what a complete model looks like — but this PR is just the *link*; the existing `security.html` content is accepted as the model.

After this lands on `master`, the same two files would need to be on `branch-3.5` for the second scan target — happy to open a cherry-pick PR for that, or leave it to the PMC.

Questions / pushback welcome. Happy to adjust the wording or move the section if the project has a house style.

Closes #55933 from potiuk/asf-security/discoverability-2026-05-18.

Lead-authored-by: Jarek Potiuk <jarek@potiuk.com>
Co-authored-by: Xiao Li <gatorsmile@gmail.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 411dedc)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
HyukjinKwon pushed a commit that referenced this pull request May 22, 2026
…gent discoverability

**This is a proposal for the PMC to review — please correct, reject, or discuss as needed.** Nothing here is a requirement; the maintainer is the decision-maker.

This adds a `SECURITY.md` to the repo root and a `Security` section to the existing `AGENTS.md` so an automated scan agent can mechanically discover the project's security model via the conventional `AGENTS.md → SECURITY.md → model URL` chain. The chain terminates at the existing <https://spark.apache.org/docs/latest/security.html> page — nothing about the model content itself changes.

Context: the ASF Security team is preparing the project for an automated agentic security scan we're piloting. Such scans refuse to run if the model isn't discoverable by that path (refusing upfront beats wasting PMC reviewer cycles on a noise-heavy run against an unknown model). Discoverability is the one hard gate; everything else is suggestion. The Security team has reached out separately on the PMC's private list with the program details; this PR is the public-facing repo piece.

The Security team uses [`threat-model-producer`](https://gist.github.com/potiuk/da14a826283038ddfe38cc9fe6310573) as the rubric for what a complete model looks like — but this PR is just the *link*; the existing `security.html` content is accepted as the model.

After this lands on `master`, the same two files would need to be on `branch-3.5` for the second scan target — happy to open a cherry-pick PR for that, or leave it to the PMC.

Questions / pushback welcome. Happy to adjust the wording or move the section if the project has a house style.

Closes #55933 from potiuk/asf-security/discoverability-2026-05-18.

Lead-authored-by: Jarek Potiuk <jarek@potiuk.com>
Co-authored-by: Xiao Li <gatorsmile@gmail.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 411dedc)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
HyukjinKwon pushed a commit that referenced this pull request May 22, 2026
…gent discoverability

**This is a proposal for the PMC to review — please correct, reject, or discuss as needed.** Nothing here is a requirement; the maintainer is the decision-maker.

This adds a `SECURITY.md` to the repo root and a `Security` section to the existing `AGENTS.md` so an automated scan agent can mechanically discover the project's security model via the conventional `AGENTS.md → SECURITY.md → model URL` chain. The chain terminates at the existing <https://spark.apache.org/docs/latest/security.html> page — nothing about the model content itself changes.

Context: the ASF Security team is preparing the project for an automated agentic security scan we're piloting. Such scans refuse to run if the model isn't discoverable by that path (refusing upfront beats wasting PMC reviewer cycles on a noise-heavy run against an unknown model). Discoverability is the one hard gate; everything else is suggestion. The Security team has reached out separately on the PMC's private list with the program details; this PR is the public-facing repo piece.

The Security team uses [`threat-model-producer`](https://gist.github.com/potiuk/da14a826283038ddfe38cc9fe6310573) as the rubric for what a complete model looks like — but this PR is just the *link*; the existing `security.html` content is accepted as the model.

After this lands on `master`, the same two files would need to be on `branch-3.5` for the second scan target — happy to open a cherry-pick PR for that, or leave it to the PMC.

Questions / pushback welcome. Happy to adjust the wording or move the section if the project has a house style.

Closes #55933 from potiuk/asf-security/discoverability-2026-05-18.

Lead-authored-by: Jarek Potiuk <jarek@potiuk.com>
Co-authored-by: Xiao Li <gatorsmile@gmail.com>
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
(cherry picked from commit 411dedc)
Signed-off-by: Hyukjin Kwon <gurwls223@apache.org>
@HyukjinKwon
Copy link
Copy Markdown
Member

Merged to master, branch-4.x, branch-4.2, branch-4.1, branch-4.0 and branch-3.5.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants